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Deinococcus phoenicis strain 1P10ME T is a radiation- and desiccation-resistant bacterium isolated from a cleanroom facility 
where the Phoenix Lander spacecraft was assembled. In order to facilitate investigations of the nature of the extreme resistance 
of D. phoenicis to bactericidal factors, a draft genome sequence of D. phoenicis was determined. 
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The high flux of radiation encountered during space travel and 
on the surfaces of Mars and Europa poses a significant chal- 
lenge for the survival of microbial life. Nevertheless, some micro- 
organisms can survive the extreme radiation and low temperature 
experienced during space travel and, therefore, might be trans- 
ferred to these solar system bodies and even proliferate. One such 
organism is Deinococcus phoenicis, a pale-pink pigmented, tetrad- 
forming coccoid bacterial strain (1P10ME T ) that was originally 
isolated from the Phoenix spacecraft assembly cleanroom (1). 
Presently, the genus Deinococcus comprises 50 validly published 
species (http://www.bacterio.net/deinococcus.html) and repre- 
sents a phylogenetically diverse group. Polytaxonomic character- 
ization of thelP10ME T isolate revealed that it was only distantly 
related to previously described Deinococcus species ( 1 ). Neverthe- 
less, D. phoenicis cells exhibited levels of resistance to gamma (D 10 , 
>8 kGy) and ultraviolet (D 10 , > 1,000 J m~ 2 ) radiation that are 
comparable to those exhibited by D. radiodurans (2). The possible 
occurrence of a D. phoenicis population on spacecraft surfaces 
potentially poses a risk to the integrity of future life detection and 
sample return missions. Whole-genome sequencing was under- 
taken to better understand this organism so that spacecraft clean- 
ing practices might be modified to eliminate it. 

In this study, we determined the draft genome sequence of 
D. phoenicis strain 1P10ME T . Whole-genome shotgun sequencing 
was performed on an Illumina HiSeq 2000 instrument with a 
paired-end module at the University of Arizona Genetic Core 
Facility (Tucson, AZ). A total of 14,784,897 pairs of 100- 
nucleotide-long reads were collected. The reads were trimmed 
to remove terminal low-quality nucleotides and assembled us- 
ing Velvet 1.2.10 (3) bundled with VelvetOptimiser 2.2.5 (http: 
//bioinformatics. net.au/software.velvetoptimiser.shtml) . This re- 
sulted in 94 contigs with a G+C content of 68.96% and sizes 
ranging from 209 to 232,991 bp (with a mean length of 40,501 bp), 
containing 3,807,137 bp in total. Further scaffolding of the contigs 
yielded 66 scaffolds encompassing 3,809,008 bp, with the largest 
having 353,669 bp. Protein-coding sequences (CDSs) were pre- 
dicted using Glimmer 3.02 (4) and Prodigal 2.60 (5). tRNA genes 



were searched using tRNAscan-SE (6), while other RNA genes 
were found by homology search using NCBI-BLAST-2.2.29 + (7). 
A total of 3,558 CDSs, 47 tRNA genes, a transfer-messenger RNA 
gene, and an RNase P RNA gene were identified. In addition, 
complete 5S rRNA and 16S rRNA and partial 23S rRNA gene 
sequences were obtained. The predicted CDSs were evaluated us- 
ing BLAST against a collection of protein genes from seven com- 
plete deinococcal genomes available at the time of this writing 
(Deinococcus deserti, D. geothermalis, D. gobiensis, D. maricopensis, 
D. peraridilitoris, D. proteolyticus, and D. radiodurans). Of a core 
set of 1,032 protein genes shared by all these genomes, only one 
was not found in the draft D. phoenicis genome assembly, namely 
a gene encoding nitric oxide synthase-like protein (DR_2597 in 
D. radiodurans). Overall, 2,654 predicted CDSs were mapped sat- 
isfactorily to deinococcal genes. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBL/EMBL/GenBank 
under the accession no. JHAC00000000. The version described in 
this paper is version JHAC0 1000000. The D. phoenicis strain has 
been deposited in the USDA Agricultural Research Station (NRRL 
B-59546 T ) and German (DSM 27173 T ) culture collections. 
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